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Abstract 

We study gossip algorithms for the rumor spreading 
problem which asks each node to deliver a rumor to 
all nodes in an unknown network. Gossip algorithms 
allow nodes only to call one neighbor per round and have 
recently attracted attention as message efhcient, simple 
and robust solutions to the rumor spreading problem. 

A long series of papers analyzed the performance of 
uniform random gossip in which nodes repeatedly call a 
random neighbor to exchange all rumors with. A main 
result of this investigation was that uniform gossip com- 
pletes in 0(^^§^) rounds where $ is the conductance 
of the network. More recently, non-uniform random 
gossip schemes were devised to allow efficient rumor 
spreading in networks with bottlenecks. In particular, 
[Censor-Hillel et al., ST0C'12] gave an O(log^n) algo- 
rithm to solve the 1-local broadcast problem in which 
each node wants to exchange rumors locally with its 
1-neighborhood. By repeatedly applying this protocol 
one can solve the global rumor spreading quickly for 
all networks with small diameter, independently of the 
conductance. 

All these algorithms are inherently randomized in 
their design and analysis. A parallel research direction 
has been to reduce and determine the amount of ran- 
domness needed for efficient rumor spreading. This has 
been done via lower bounds for restricted models and by 
designing gossip algorithms with a reduced need for ran- 
domness, e.g., by using pseudorandom generators with 
short random seeds. The general intuition and consen- 
sus of these results has been that randomization plays a 
important role in effectively spreading rumors and that 
at least a poly logarithmic number of random bit are 
crucially needed. 

In this paper we improves over this state of the 
art in several ways by presenting a deterministic gossip 
algorithm that solves the the fc-local broadcast problem 
in 2{k + log n) log n roundfj^ Besides being the first 
efficient deterministic solution to the rumor spreading 
problem this algorithm is interesting in many aspects: 
It is simpler, more natural, more robust and faster than 
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^Throughout this paper logx denotes [log2a;], that is, the 
rounded up binary logarithm. 



its randomized pendant and guarantees success with 
certainty instead of with high probability. Its analysis 
is furthermore simple, self-contained and fundamentally 
different from prior works. 

1 Introduction 

Broadcasting, that is, disseminating information 
present initially at different nodes in an unknown net- 
work to every node, is a fundamental network commu- 
nication primitive with many applications. It has been 
studied under different names such as gossip, rumor 
spreading, information dissemination, (all-to-all) mul- 
ticast or (global) broadcast. 

Gossip algorithms during which nodes contact or 
call only one neighbor at a time have been proposed 
as a powerful time and message efficient alternatives 
to flooding, i.e., repeatedly forwarding information to 
all neighbors, or structured broadcast protocols which 
often require a stable network with known topology. 

The simplest and most widely studied form of gos- 
sip is uniform (random) gossip in which nodes repeat- 
edly call a random neighbor to exchange information. 
A series of results showed that this algorithm per- 
forms well on well-connected graphs with no bottle- 
neck(s) [TTl (THl |5J O [T^. More precisely, the main re- 
sult is a tight bound of 0(^^§^) rounds, where $ is 
the conductance of the network. More recently, non- 
uniform random gossip schemes were devised to allow 
efficient rumor spreading in arbitrary networks [H |3]. 
The local broadcast problem, that asks each node to ex- 
change rumors locally with all its neighbors, has been a 
crucial abstraction to obtain results independent from 
any conductance-type measure. In particular, building 
on the results on uniform gossip it was shown in [3] 
how to solve the local broadcast problem in 0(log'^ n) 
rounds. Repeatedly applying this solution leads to an 
0{D\og^ n) global broadcast in any network with di- 
ameter D and thus to a polylogarithmic time gossip so- 
lution for any network with polylogarithmic diameter. 
Using connections to spanners one can furthermore get 
a 0{D + log'^^^^) solution. 

All these algorithms are inherently randomized in 
both their design and analysis in that they crucially 
rely on the effect that choosing neighbors randomly for 



forwarding disperses information quickly in expanders 
or expanding subgraphs. A parallel research direction 
to finding faster and more general gossip algorithms has 
been to study the necessity for this randomization. In 
particular, there are both lower bounds quantifying how 
much randomness is inherently needed for efficient gos- 
sip algorithms [H [Ml [16] and newly designed gossip pro- 
tocols that work with reduced amounts of randomness 
[HI M [ini [S] ■ The general intuition and also the consen- 
sus of these works has been that (some) randomization 
plays a crucial role in effectively spreading rumors. 

1.1 Our Results In this paper we contribute to both 
of these research directions by presenting a fast, simple, 
natural, robust and deterministic gossip algorithm for 
the local broadcast problem: 

Theorem 1.1. For any k there is a simple determin- 
istic gossip algorithm that runs for 2(fclogri -t- log^ n) 
rounds on any n-node network and solves the k-local 
broadcast problem, that is, allows each node to exchange 
a rumor with each node at distance at most k. 

The next corollary shows that this directly implies 
a fast and simple algorithm for the global broadcast 
problem as well. We remark that using the connection 
to spanners from |3] one can also obtain a (significantly 
less simple) deterministic gossip algorithm solving the 
global broadcast problem in 0{D + log*^^^-* n) rounds, 
given that there exists a deterministic variant of the 
spanner construction from |17j . 

Corollary 1.1. There is a simple deterministic gos- 
sip algorithm that runs for 2(£)logn -|- log^ n) rounds 
on any n-node network with diameter D and solves the 
global broadcast problem. 

Certainly, the most striking aspect of Theorem |1.1| 
is that it constitutes the first deterministic gossip algo- 
rithm for the broadcast setting studied here and in [4l[3]. 
We feel that this is a very interesting and surpris- 
ing result given the general intuition and the results 
from [71 [131 [ini US] that at least some randomness is 
needed to enable an efficient gossip algorithm. We em- 
phasize that our result does not stem from unusual 
model assumptions. Exactly as in, e.g., [H [3] we only 
assume that (1) each node only knows the IDs of its 
neighbors and (2) two nodes involved in a call can ex- 
change all rumors known to them. 

Our algorithm has several other advantages. For 
one it is faster than previous randomized algorithms: 
For k = 1 our algorithm is logarithmically faster 
than [5] while for k = log n its 4 log^ n running time 
is a 0(log^ 7i)-factor and therefore even quadratically 



faster. Even when compared to the performance of 
uniform gossip it is at most a log n- factor slower on 
expanders while continuing to be near optimal on any 
other topology. Furthermore, any faster (randomized) 
algorithm with running time 0(log^~'^n) for k — logn 
would imply the existence of spanners with better 
than known quality (see also conclusion). Lastly, its 
deterministic nature brings with it the advantage that 
these running time guarantees (and the correctness 
property) hold with certainty instead of with high 
probability. 

Our algorithm is also simpler, more natural and 
more robust. In fact, one of our algorithms is a 
simplification of [5] in which furthermore all randomized 
choices are replaced by arbitrary ones. While our 
analysis is powerful enough to work in this more general 
setting it remains simple, short and self-contained. It 
also extends nicely to analyzing a wide variety of related 
natural processes demonstrating the robustness of both 
algorithm and analysis. 

1.2 Related Work Gossip and rumor spreading has 
been intensely studied both for the setting of a single 
rumor being spread and for a rumor being spread from 
each node in the network. A difference between the 
two settings becomes mostly apparent when the amount 
of information exchanged between nodes in a round is 
limited. In such a scenario very different techniques like 
algebraic gossip [15] become interesting. In this paper 
we assume large packet sizes that allow two nodes to 
exchange potentially all rumors in one packet. In this 
setting it typically does not matter whether one or more 
rumors are to be spread. 

The spreading of a rumor according to the uniform 
gossip process was first considered by Frieze and Gim- 
met [11] and subsequently Pittel [19] which proved that 
only (1 -|- In 2) log n -\- 0{\) rounds are needed on the 
complete graph. Lower bounds and non-uniform algo- 
rithms for the complete graph were investigated by [16] . 
Going to more general topologies [6] showed that uni- 
form gossip works well in any expander. More precisely 
an 0( '°|6" ) spreading time was proven for any graph 
with conductance This bound was then improved to 
Q( iog^ ^-^ logn ^ [5] and finally 0(i2|^i) ^12,. This bound 
is tight since for any <I> = VL{l/n) there are graphs 
with conductance on which uniform gossip requires 
©(^^$^) rounds. More generally, uniform gossip can be 
easily seen to perform as good or bad as the worst bot- 
tleneck in the network. 

Several papers have showed improvements over the 
uniform gossip protocol that cope better with bottle- 
necks. In [1] a rumor spreading algorithm was pro- 



vided for any graph with good weak expansion which 
includes some graphs with (few) bottlenecks. The first 
algorithm to work efficiently on any topology was [3] . As 
already summarized in the introduction this paper gives 
a 0(log'^ n) algorithm for the local broadcast problem as 
a crucial step to achieve a polylogarithmic running time 
on any network with polylogarithmic diameter. For all 
algorithms up to then there existed networks with di- 
ameter O(logn) on which gossip took Vt{n) time. 

The other research direction that is relevant for 
this paper aimed at determining and reducing the 
amount of randomness required by efficient gossip algo- 
rithms. A very interesting and successful way to reduce 
the amount of randomness is the quasirandom rumor 
spreading process of [F' . This protocol assumes an arbi- 
trary cyclic ordering of neighbors at each nodes but ran- 
domizes it by picking a random random starting point. 
It was shown that this simple algorithm achieves a sim- 
ilar performance as the fully random uniform gossip for 
many topologies like the complete graph [TOl [1] , random 
graphs, hypercubes or expanders The quasirandom 
rumor spreading process only requires O(logn) bits of 
randomness per node to pick the starting position. In [7| 
it was shown that one cannot further reduce this amount 
without a severe loss of efficiency. In particular if one 
uses o(log n) random bits to choose uniformly at random 
between a subset of equidistant starting points then the 
number of rounds becomes almost linear instead of log- 
arithmic [7]. The question how much randomness suf- 
fices for gossip to be efficient was also addressed in [14] . 
This paper presents an algorithm that uses a total of 
n log log n bits of randomness, gives a non-constructive 
argument for the existence of a gossip algorithm with 
roughly 2 log n bits of randomness and shows that no 
algorithm from a natural class of gossip algorithms can 
use less than logarithmic amount of randomness without 
taking roughly linear time. Algorithms that use pseudo- 
random generators or hashing schemes require only one 
random seed of polylogarithmic length and thus al- 
most achieve this low total amount of randomness. 

Organization The rest of the paper is organized as 
follows: In Section[2]we first formally define the network 
model and the (local) broadcast problem. In Section [s] 
we give a randomized local broadcast algorithm that 
simplifies the algorithm from In Section |4] we 

further simplify this algorithm and give a novel, and 
more powerful analysis that shows that this algorithm 
remains correct and efficient when all random choices 
are replaced by arbitrary deterministic choices. In 
Section [5] we show how to achieve further speed ups 
and obtain Theorem |1.1[ Lastly, in Section |6] we show 
that our algorithm can be seen as a very natural process 
with many desirable robustness properties. 



2 Gossip Algorithms and the Local Broadcast 
Problem 

In this section we define gossip protocols, the class of 
communication algorithms we are interested in, and the 
global and local broadcast problems. 

2.1 Gossip Algorithms We study gossip proto- 
cols, that is, synchronous communication algorithms in 
which in each round each node calls (at most) one neigh- 
bor for a bidirectional information exchange. This type 
of algorithm fits the uniform PUSH-PULL gossip pro- 
tocols that have been widely studied and matches algo- 
rithms allowed in the GOSSIP model that was defined 
in [3 as a restriction of the standard LOCAL model for 
distributed computing. The setting is given as follows: 

Netv^rork: 

A network is specified by an undirected graph G — 
{V, E) with node set V and edge set E. We denote the 
number of nodes with n = \V\^ the number or edges 
with m = \E\ and the diameter of G with D. For every 
node V £ V we define the neighborhood Tg{v) to be 
the set of nodes whose distance to v is at most one 
(including itself). Similarly, we define T^Qiv) to be all 
nodes of distance at most k from v and call this the 
fc-neighborhood. We omit the subscript if the graph is 
clear from the context. 

Communication: 

Nodes communicate in synchronous rounds t € 
{0, 1, . . .}. In each round t each node chooses a message 
and an incident edge. We denote the union of the 
selected edges in a round t with Et and the resulting 
graph with Gt = {V,Et). After this selection process 
for any node v the message of v is delivered to all 
nodes w S Tctiv), that is, all nodes that contacted v or 
were contacted by w. In short, we allow every nodes to 
initialize one call or bidirectional message exchange per 
round. 

We assume that the cost of communication is com- 
pletely covered by the caller, i.e., the node that initial- 
izes a contact. This leads to merely a constant cost for 
communication per node and round. 

Similar to the LOCAL model the GOSSIP model 
of j3] does in principle not limit the complexity of local 
computations or the size of the messages. This allows a 
node to always exchange all rumors it knows. Indeed, 
all messages sent by a node during our protocols will 
simply consist the collection of rumors it knows. For 
any node v we denote this collection with R^. 

Initial Knov^rledge: 

We assume that each node has a unique identifier 
(UID). Each node initially solely knows its UID and the 
UIDs of its neighbors. It is important that the network 



topology G is unknown to the nodes and we assume that 
no further knowledge about the network is known to the 
nodes. 

Important Remarks: 

• We will assume that in the beginning nodes attach 
their UID to their rumor. In this way, each node 
can easily tell which of its neighbors' rumors it has 
already (indirectly) received. This also allows us to 
identify a node, its rumor and its UID and interpret 
the set of rumors known to node w as a set of 
neighboring nodes it has (indirectly) heard from. 

• While the GOSSIP model allows arbitrary local 
computations and message sizes our algorithms do 
not exploit this freedom: The local computations 
are extremely simple and while the set of rumors 
sent in a message can reach a size of Q{n) in the 
worst-case, this is optimal up to a small polyloga- 
rithmic factor considering that in dense graphs the 
total amount of information learned by nodes dur- 
ing a 1-neighborhood exchange is of order ri(n^) 
while only 0(n polylog n) messages are exchanged. 

• The UID's assumed in our model are not used for 
symmetry breaking but solely to allow nodes to talk 
about other nodes. The only operations that are 
used on the UIDs is that they are routed through 
the network with their rumor and compared for 
equality to decide whether a node has already heard 
(indirectly) from one of its neighbors or not. 

2.2 The Rumor Spreading and Local Broadcast 
Problem The classical problem to be solved by gossip 
algorithms is the following global broadcast problem: 

Definition 2.1. ((Global) Broadcast Problem) 
Each node v starts with one rumor r^ and the task is 
to inform every node about all rumors. 

In this paper we mostly focus on solving the fc-local 
broadcast problem which is an important refinement of 
the global broadcast problem. 

Definition 2.2. (fc-LocAL Broadcast Problem) 
In the k-local broadcast problem each node v starts with 
one rumor r^ and the task is for each node v to learn 
all rumors r^ of nodes u G T^{v) in its k -neighborhood. 

There are several motivations to introduce and study 
the fc-local broadcast problem: 

• It generalizes the global broadcast problem. In 
particular the global broadcast problem is equiva- 
lent to the n-local broadcast problem or the fc-local 
broadcast problem for k > D, where D is the di- 
ameter of the network. 



• As explained before, the local-broadcast problem 
has been proven crucial as a subproblem to solve 
the global broadcast problem in general topologies 
with bottlenecks. 

• The local broadcast problem composes nicely. In 
particular, any 0{T) algorithm for the fc-local 
broadcast problem also gives an 0(Z • T) algorithm 
for the I ■ fc-neighborhood exchange problem for any 
integer I > 1: Simply repeat the broadcast I times. 

• The 1-local broadcast problem and the (log*^ n)- 
local broadcast problem are natural distributed 
communication problems in their own right: Es- 
pecially in distributed settings with large networks 
it is realistic that nodes are only interested in suf- 
ficiently local information. 

• It was observed in [3] that the 1-local broadcast 
problem corresponds to one communication step in 
the LOCAL model. Thus with a T-round gossip 
algorithm for the fc-local broadcast problem any T'- 
round distributed algorithm in the LOCAL model 
can be simulated by an • T'] gossip algorithm. 
Many distributed problems have 0{\og'^ n) LOCAL 
algorithms, that is, can be solved by each node 
knowing only its O(log'^n) neighborhood. This 
reinforces the importance of the 0(log^ n)-local 
broadcast problem. 

Remark: The GOSSIP model does not initialize nodes 
with any non-local knowledge about the network in 
particular nodes do not know the network size n. For 
the 1-local broadcast this is not a crucial assumption. 
Indeed, since it is easy to verify locally whether a 1-local 
broadcast completed nodes can simply guess an upper 
bound on n and square their guess if the algorithm 
does not complete with their guess. For algorithms 
with a polylogarithmic running time squaring the guess 
increases the running time by a constant factor and 
the total running forms geometric sum which is only 
a constant factor larger than the final execution with 
the fist correct upper bound for n. For randomized 
algorithms the same verification/restart strategy can be 
used to transform any Monte Carlo algorithm to a Las 
Vegas algorithm. 

3 A Simpler Randomized 1-Local Broadcast 
Gossip Algorithm 

In this section we give a simple randomized gossip al- 
gorithms for the 1-local broadcast problem. The al- 
gorithm can be seen as a simplification of the algo- 
rithm given in [3] . To analyze this algorithm we use the 
methods of [3] namely a decomposition of any graph 



into expanders and the efficiency of random gossip in 
expanders. In Section |4] we will then further strip 
down this algorithm and show that replacing its ran- 
dom choices by arbitrary deterministic choices does not 
affect its correctness or efficiency. 

3.1 Round Robin Flooding The gossip algorithm 
in this section uses a simple round robin flooding 
subroutine which we introduce here first. 

Suppose all nodes have established links to at most 
A neighbors. It is quite straight forward to flood 
information along these links in A rounds by each 
node exchanging information over its links one by one. 
Essentially repeating this d times floods messages for 
d-hops along all established links in dA steps. For 
completeness we add the exact statement and algorithm 
for this flooding procedure next: 

Lemma 3.1. Suppose each node v knows the rumors 
and has selected A„ links to nodes ri^(l), . . . , ri^(At,). 
Suppose also, the distance d and an upper bound of A 
on max„ A„ is given to every node. 

Then, Algorithm 1 spreads each rumor for d hops along 
the selected links in Ad rounds. 

That is, each node v knows exactly the rumors in 
UuGT'' (v) after termination where G' — (V, E') is 

the undirected graph with E' = (J {{v,n^{i)}} . 

t),i<A„ 



Algorithm 1: Flood (Round Robin) 
(Input: max. deg. A, own deg. A„, distance d, 
neighbors n,u(l), . . . ,n^(A„), rumors Ry) 

REPEAT d times 
i?' = 

FOR t = 1 to A 
IF t < A^ THEN 

exchange rumors in R^ with 
ELSE wait 

add all received rumors to R' 

Ry = Ry U R' 



Proof. [Proof of Lemma 3.1 We denote with the 



set of tokens known to node v at the beginning of 
iteration i. The rumors collected in R' during iteration 
i by node v are exactly the rumors exchanged with 
nodes neighboring v in G" since for each undirected link 
{u, v} € E' either w or m initializes a bidirectional rumor 
exchange. We thus get that for any node v and any 
iteration i we have Ry{i + 1) = U«grG/(t)) -^«(*)- Note 



furthermore, that for every v and every k we also have 
r^t^w) = U„erG/(«)rG'(")- Now, using induction on 
the number of iterations with these two statements we 
directly obtain that Rv{d) = Uuer'' («) asserted. 

3.2 A Simple Randomized 1-Local Broadcast 
Gossip Algorithm In this part we present a simple 
randomized gossip protocol. The algorithm and even 
more its analysis are inspired by |3j but are arguably 
simpler and more natural. To further simplify the 
presentation we did not optimize the running time of 
the algorithm presented here. 

The randomized gossip protocol does the following 
for each node v ^ V in parallel: 



Algorithm 2: Randomized Gossip 

Ry — V 

WHILE TGiv)\Ry ^% 

pick 0(log^ n) random edges to T{v) \ Ry 
d = 0(log^ n); E' — all (newly) picked edges 
Flood rumors in Ry along _E'-edges for d-hops 
add all received rumors to Ry 



It is clear by construction that Algorithm 2 cor- 
rectly solves the 1-local broadcast problem since a node 
continues contacting its neighbors until it has received 
the rumors from all of them. The next lemma proves 
that with high probability Algorithm 2 is furthermore 
very efficient. 

Lemma 3.2. With high probability Algorithm 2 takes at 
most 41ogn iterations and solves the 1-local broadcast 
problem in 0(log^ n) rounds ( or 0(log^ n) rounds if only 
newly picked links are used during the flooding). 



To proof Lemma |3.2| we need the following two 
lemmas about expanders (for the definition of the 
expansion used here we refer to [3]): 

Lemma 3.3. (Lemma 3.1 in pj]) Every graph can be 
partitioned into disjoint node subsets such that: 

• Any subset forms a log n)- expander (when 
adding edges leaving the subset as self-loops). 

• At least a third of all edges are intra-partition edges, 
i.e., both endpoints lie in one subset of the partition. 

Lemma 3.4. Let G be an n-node graph and expansion 
$ and let T — ri(^^^J^). Suppose for every vertex v 
we uniformly sample T neighboring edges ( with replace- 
ment) and let G' be the subgraph of G consisting of the 
union of all selected edges. With high probability G' has 
diameter at most T . 



Proof. This follows directly from the result of [12] that 
uniform gossip solves the global broadcast problem in 
G in T steps. To see this we note that G' can be seen 
as the graph of edges initiated during such a run of 
the uniform gossip protocol. Furthermore, since each 
message travels at most one step in each round and 
every node learns about all messages in T steps the 
graph G' has diameter at most T. 

The result of [12\ which we used to prove 



Lemma 3.4 was also used in 3\ Interestingly, it seems 
much stronger than Lemma |3.4| and we suspect that 
Lemma |3.4| itself can be proved using simpler methods. 
One way would be to use sparsification results to show 
that under the specified subsampling any <I>-expander 
maintains its expansion and thus also its diameter of 

o(T'). 

We are now ready to proof Lemma |3.2[ 



Proof. Note that since hooding for d hops is symmetric 
we have at the beginning of any iteration i that u has 
not heard from v yet if and only if v has not heard 
from u. In this case we say the edge {u,v} g G is 
active and we denote the graph of active edges at the 
beginning of iteration i with Ki. To prove that logn 
iterations are sufficient we will show that with high 
probability the number of active edges decreases by a 
factor of 2/3 in every iteration. For this, we apply 
Lemma |3.3| on Ki to get a partitioning of nodes into 
subsets that induce $ = 0(1/ logn) expanders. The 
sampling of Q(\og^ n) uniformly random new neighbors 
in Algorithm 1 now directly corresponds to subsampling 
each of these expanders in the same way as described 



in Lemma |3.4| From Lemma 3.4 we thus get that with 
high probability the distance between any two nodes in 
the same partition along the newly established links is at 
most 0(logri/$) = 0(log^ n). With high probability we 
thus get that any intra-partition edge becomes inactive 
after flooding for 0(log^ n) hops along the (newly) 
selected links. Since Lemma |3.3| guarantees that at 
least a third of the active edges are intra-partition edges 
we have established that with high probability at most 
— log2/3 < ^ fog n many iterations are needed. To 
determine the total running time we note that each 
node established at most 0(log^ n) new links in each 
iteration. The number of total links established by 
a node is thus at most O(log^ri). This means we 
can run the flooding protocol, i.e.. Algorithm 2, with 
d = e(log^ n) and A = 0(log^ n) (or A = 0(log^ n) if 
we only use newly established links). The running time 
for one iteration is thus 0(log^ n) (or 0(log^ n)) rounds 
according to Lemma [XT] Over at most 4 logn iterations 
this sums up to a total of O(log^n) (or O(log^n)) 
rounds. 



4 The New Deterministic 1-Local Broadcast 
Gossip Protocol 

In this section we give the simplest description of our 
deterministic gossip protocol for the 1-local broadcast 
problem. It is identical to Algorithm 2 presented in the 
last section except for the following three simplifications 
(marked in bold in Algorithm 3): 

• Instead of 8(log^ n) new edges per round only one 
new edge is added per node. 

• Instead of 9(log^ n) hops messages are only flooded 
for 2 log n hops. 

• Most importantly, the new edge(s) are not anymore 
required to be chosen uniformly at random but can 
be chosen in any arbitrary (deterministic) way. 

Surprisingly, we will show next that this severely 
stripped down algorithm still takes only log n iterations 
to solve the 1-local broadcast problem (now determin- 
istically and always instead of with high probability) . 

As already described, our deterministic gossip pro- 
tocol does the following for each node w e y in parallel: 



Algorithm 3: Deterministic Gossip 

— V 

WHILE r(u) \ i?^ ^ 

arbitrarily pick one new edge to r(w) \ _R„ 
d = 1 log n; E' — all established links 
Flood rumors in along £"-edges for d-hops 
add all received rumors to i?„ 



Lemma 4.1. Algorithm 3 takes at most logn iterations 
and solves the 1-local broadcast problem in at most 
2 log'^ n rounds. 

We first need to define binomial trees for our 
analysis: 

Definition 4.1. A binomial tree of order 2^ or short i- 
tree is a rooted depth i tree on 2* nodes that is inductively 
defined as follows: A 0-tree consists of a single node. 
For any i > a i + 1-tree is formed by taking two i- 
trees, connecting their roots and declaring one of the 
roots as the new root. 

To prove Algorithm 3 efficient we will use a short 
and simple inductive argument that for any node that 
has not terminated at iteration i it is possible to find a 
j-tree rooted at it in G. Since i-trees grow exponentially 
in i this limits the number of iterations to log n. 
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Figure 1: i-trees for i e {0, 1, 2, 3}. 



Lemma 4.2. Consider the beginning of any iteration 

< i < \ogn in Algorithm 3 and Hi he the graph of all 
edges used until then. Suppose there is a node vq with 
k missing rumors, that is, Tcivi)) \ Ryg = {vi, . . . ,Vk}- 
Then there are k + 1 many i-trees tq, . . . , as subgraphs 
in Hi rooted at vo,vi, ... ,Vk respectively such that tq is 
vertex disjoint from Tj for any < j < k. 

Proof. Wc proof the lemma by induction on t. The base 
case for i = follows directly from the fact that each 
node forms its own 0-tree. For the inductive step we 
assume a vertex vq which at the beginning of iteration 

1 + 1 < log n is still active. Let uq be the vertex 
contacted by vq in iteration i. By induction hypotheses 
in the beginning of iteration i there was an i-tree rooted 
at Vq and a vertex disjoint i-tree rooted at uq. These 
two trees together with the new edge {vq, uq} form the 
new i + 1-tree tq in 

Next we note that the symmetry of flooding for d 
hops ensures that whenever an ID a gets added to i?f, the 
ID b also gets added to Ra. Therefore if FgCwq) \ Rv,, — 
{vi, . . . , Vk} at the beginning of iteration i + 1 it must 
be that all vj £ rG{vo)\Ro also have vq G Fg (wj ) \ i?!,^ . 
Every node vj € Tg(vo) \ Rq was therefore also active 
at the beginning of round i and must have chosen an 
edge to a node Uj. Similarly as done for vq we can find 
an i + 1-trce tj that consists of the i-trees rooted at 
Vj and Uj at iteration i and the edge {vj,Uj}. It only 



remains to show that tq and Tj are node disjoint for all 
j. Assume for sake of a contradiction that there is a j 
such that To and Tj share a node. In this case there is a 
path in Hi+i from vq to Vj of length at most 21ogn as 
the depths of both tq and Tj is at most log n. But iJi+i 
is the graph along which IDs and rumors are flooded for 
2 log n hops during iteration i. Thus Vj would be in 
at the beginning of iteration i + 1 - this is the desired 
contradiction that completes the proof. 



Proof. [Proof of Lemma 4.1 Algorithm 3 correctly 
solves the 1-local broadcast problem by construction 
since it keeps contacting new neighbors until it has 
received the rumors from all of them. Lemma 14.21 
furthermore proves that if the protocol is not done after 
the iteration log n at the beginning of the next iteration 
we can find two neighbors that do not know of each 
other and two node-disjoint (log rt)-trees as subgraphs 
of G. Since the number of nodes is n this is impossible 
and shows that Algorithm 3 performs at most logn 
iterations. In each iteration at most one new link 
is established per node for a total of at most logn 
links. Flooding for d — 21ogn hops in the graph of 
all established links using Algorithm 2 with A — \ogn 
thus takes 2 log^ n rounds. Over log n iteration this 
accumulates to 2 log"^ n rounds in total. 

5 More Efficient Broadcast Protocols 

In this section we show how to tweak the deterministic 
gossip protocol from Section [4] to achieve faster fc- local 
broadcast protocols. In particular in this section we 
prove Theorem 1 1.1 1 by showing how to solve the fc- local 
broadcast problem deterministically in 2(fc + logn) logn 
rounds. 

5.1 Faster 1-Local Broadcast via Deterministic 
Tree Gossip First we speed-up our solution for the 1- 
local broadcast problem by replacing the flooding step 
in Algorithm 3. We use two key observations: 

• The flooding steps in Algorithm 3 are only per- 
formed to ensure that in every iteration i any node 
Vq picks a new neighbor whose i-tree does not in- 
tersect with its own i-tree. 

• The structure of these i-trees allows for spreading 
rumors within the trees faster than using the round 
robin flooding procedure. 

To better understand the structure of the i-trees 
constructed in the proof of Lemma |4.2| we will give an 
alternative construction. For any node vq that has not 
terminated until iteration i we construct its i-tree tq as 
follows: 



The root of tq is vq and its children are the nodes 
ui, . . . ,Ui-i contacted by vq in the iterations up to i. 
For each of these child nodes Ui' we then attach as 
children all nodes wi, . . . , Wi'-i contacted by node Uii 
in the iteration up to i'. We continue inductively for 
each of these nodes Wi" . 

It is easy to see that this produces the same i-tree 
as the one constructed in the proof of Lemma |4.2| In 
addition to being helpful for our proofs this construction 



has a nice interpretation. The z-tree tq in Lemma 4.2 
can be seen as a witness structure that certifies and 
explains why node vq was active until iteration i, 
namely: 

Node Vq did not terminate until iteration i because 
there were the neighbors ui,. . . that were active 

and unknown to Vq at time 1 to i — 1 respectively 
resulting in vq contacting them. Each of these nodes 

on the other hand was still active and did not 
contact vo itself until iteration i' because of its neighbors 
wi,. . . ,Wi>-i that were active and unknown to m,' at 
time 1 to i' — 1 respectively resulting in Ui' contacting 
them, and so on. 



With this interpretation the proof of Lemma 4.1 
essentially says that Algorithm 3 cannot have an node 
Vq that is still active after log n iterations since any 
explanation tq for why it is still active would have to 
blame more nodes than exist in the network. 

Next we will show how to exploit the structure of 
these i-trees. The deterministic gossip protocol that 
does this performs the following for each node v E V in 
parallel: 



Algorithm 4: Deterministic Tree Gossip 

FOR i = 1 UNTIL T{v) \ i? = 

link to any new neighbor ut G r(w) \ TV 
R' = V 

PUSH: For j = i downto 1: 

exchange rumors in R' with uj 
add all received rumors to R' 

PULL: For j = I to i: 

exchange rumors in R' with uj 
add all received rumors to R' 

R" 

perform PULL, PUSH with R" 
R = R'U R" 



Remark: 

Note that Algorithm 4 does not require knowledge of the 
network size n. This can also be achieved in Algorithm 
3 if one instead of flooding for 2 log n hops in each 



iteration only floods for 2i hops in the ith iteration. 
This also speeds up the running time by a factor of two 
and avoids the guess-and-double strategy remarked in 
Section |2. II for the case that n is unknown. 



Theorem 5.1. Algorithm 4, solves the 1-local broadcast 
problem in logn iterations and less than 2(logn + 1)^ 
rounds. 

The proof is essentially the same as for Lemma [4T] 
except for the use of the following lemma: 

Lemma 5.1. Suppose u and v are two nodes that are 
active at the beginning of iteration i and that r„ and 
Ty are their ordered i-trees respectively. Then, after 
the PUSH exchanges in iteration i all nodes in have 
learned about u ( and all nodes in Ty have learned about 
v). Furthermore, ifr^ andr^ are not node disjoint then 
after the PUSH-PULL exchanges in iteration i the node 
u has learned about v (and vice versa). 

Proof. We note that by construction all paths from the 
root 7i to a any other node in follow tree edges 
in decreasing order. During the first PUSH sequence 
edges are activated in decreasing order which thus 
pipelines the rumor of u from the root to all nodes 
in T„. Similarly, rumors known to any nodes in t„ 
gets pipelined towards the root u during the first PULL 
sequence. The same is true by symmetry for v and Ty. 
Now, if Tu and Ty share a node y then y will learn about 
u and v during the first PUSH sequence of iteration i 
and then forward this information to u and v during the 
first PULL sequence - informing both nodes about each 
other. 



Proof. [Theorem 5.1 We use the exact statement as 
in Lemma |4.2| to show that only log n iterations are 
performed by Algorithm 4. The first thing to check is 
that Algorithm 4 still maintains symmetry with regards 
to which nodes knows which. This is achieved by the 
the fact that the PULL-PUSH sequence of the second 
part is exactly the reversal of the PUSH-PULL sequence 
of the first part of any iteration. As such, if a node v 
learns about a node u in the PUSH-PULL sequence of 
exchanges then u will learn about v during the PULL- 
PUSH exchanges and vice versa. By taking R to be 
the union of R' and R" it is clear that we indeed get 
u E Ry ii and only if w G Ru- The rest of the proof of 



Lemma 4.2 goes exactly the same except that we relay 
on Lemma |5.1| for the fact that if vq and Vi do not know 
about each other then their i-trees must be disjoint. 
This completes the proof that at most logn iterations 
are performed. To determine the total number of rounds 
needed we note that in the ith iteration exactly 4i 




Figure 2: A 5-tree. The labels on the edges denote at 
what time they were added. These edges are activated 
from low to high during a PUSH exchange and activated 
from high to low in a PULL exchange. Note that 
the path from the root to any node follows edges in 
decreasing order. 



exchanges are performed. This leads to a total round 
complexity of 4* = 2 log n (log n + 1). 

5.2 Speeding up the fc-Local Broadcast for fc > 1 

What remains to show for Theorem ll.ll is how to use our 
algorithms for the /c- local broadcast problem for fc > 1. 



As remarked in Section 2.2 a straight forward way to 
use Algorithm 2 - 4 for the fc-local broadcast problem 
is simply applying them k times. Using our Algorithm 
4 this leads to a log^ n solution. We can improve 
upon this by realizing that in all algorithms the 1-local 
broadcast problem is actually solved completely during 
the flooding or tree-broadcast in the last iteration. All 
prior iterations are only needed to guarantee that the 
links chosen are going to new neighbors. To exploit this, 
we only use the full blown algorithms for the first 1- 
local broadcast and then reuse the established links by 
simply repeating the flooding or tree-broadcast of the 
last iteration for the remaining local broadcasts. When 
using Algorithm 4 for this we need 2 log n (log n + 1) 
round for the first local broadcast and only 21ogn 
rounds for each of the fc — 1 remaining ones. This leads 
to the running time claimed in Theorem 



6 Natural Gossip Processes and Robustness 

In addition to their message and time efficiency gossip 
algorithm have also been studied because of their natu- 
ralness and robustness. 

Unfortunately, while the algorithms in [H |3] deal 
much better with bottlenecks in the topology than the 
uniform gossip protocol they are also much less natural 
and robust. This holds in particular for the local 
broadcast algorithm of [3] which crucially relies on a 
very unnatural reversal step to guarantee correctness. 
This algorithm can furthermore fail completely if only 
one link is temporarily down in one round or if due 
to a slight asynchrony the order of two exchanges gets 
switched. 

In this section we show that the algorithms and 
analysis presented in this paper are robust and can be 
seen and phrased as a natural process. The latter is 
best demonstrated by considering as an example the 
following surprisingly accurate/realistic social setting: 

Interpret nodes as curious but somewhat shy per- 
sons that want to know (everything) about their neigh- 
bors but only rarely have the courage to approach a per- 
son they know nothing about; instead they prefer to talk 
to neighbors they have already approached before and ex- 
change rumors/information about others. 

We will show that our analysis is flexible enough to 
not just show fast rumor propagation for our specifically 
designed algorithms but broadly covers a wide variety 
of high level processes including the one above. In par- 
ticular for this setting our analysis implies the follow- 
ing: Rumors spread rapidly in such a social setting as 
long as a person talks (by a factor of log^ n) more fre- 
quently to a person approached before compared to ap- 
proaching a new neighbor (whose rumor is not known) . 
Interestingly, our result is flexible enough to allow the 
(social) process according to which nodes choose which 
new person they find most approachable at any time to 
be arbitrarily dependent and complex. 

To show the flexibility of our analysis we consider 
the template given as Algoithm 5. It is coined in terms 
of iterations each consisting of a linking step and a 
propagation step. 



Algorithm 5: Gossip Template 
REPEAT 

link to any neighbor whose rumor is not known 
propagate rumors among established links 
UNTIL all rumors are known 



To analyze this template we introduce some nota- 



tion: Let Rv{t) C r{v) be the neighbors of node v it 
knows the IDs and rumors of at the beginning of iter- 
ation t. Furthermore let Gt be the undirected graph 
that consists of all edges added in the linking proce- 
dure. With this we first show a strong but admittedly 
relatively technically phrased lemma. 

Lemma 6.1. Let Tmin be the maximum number of prop- 
agation steps it takes for a nodes u to learn about an- 
other node V for which there exists a path of estab- 
lished links of length at most 21ogn. Furthermore, for 
any two nodes u, v let Tdi ff be the maximum num- 
ber of propagation steps it takes from the time that u 
knows V until v also knows u. With these two parame- 
ters Algorithm 5 takes at most Tlogn iterations, where 
T = max{rdi//,ri„in}. 

Remark: 

The algorithm in 3J and all algorithms presented in 
this paper so far keep perfect symmetry of knowledge, 
that is, if a node u knows the rumor of v then v 
also knows the rumor of u. This is achieved by 
flooding for exactly d hops or by carefully reversing the 
sequence in which edges where chosen. This symmetry 
is used in all proofs so far. If furthermore turns out 
to be crucial for the efficiency of the algorithm of 
[3]: Indeed, [3] gives an example in which introducing 
a slight asymmetry increases the running time from 
polylogarithmic to linear. The expander decomposition 
proof of [3] seems furthermore unsuitable for extensions 
to a more asymmetric setting. While our algorithms 
so far also featured perfect symmetry Lemma [6T] shows 
that both algorithms and our analysis are robust enough 
to relax this requirement significantly. 



While Lemma RO shows robustness it does not read 
too natural or close to the informal description given 
before. This is remedied by the next two corollaries 
which give some examples on how it can be used. 

Corollary 6.1. Suppose a rumor dies out / looses 
credibility (i.e., is not forwarded anymore) after it has 
been passed around for more than A hops where X > 
logn. Suppose also that nodes establish links at least 
every a steps and talk to each established link at least 
every (3 steps. Then the 1-local broadcast completes after 
at most a/3X steps. 

Or on an even more concrete example: 

Corollary 6.2. Suppose a rumor dies out after 
0{\ogn) hops. Suppose also that each node indepen- 
dently chooses with probability p — 1/log^n to talk 
to a neighbor it has not heard from yet and otherwise 



talks to a random neighbor it has already contacted be- 
fore. In this setting the 1-local broadcast stops after 
T ~ O(log^n) iterations with high probability. 

Given the above demonstrated fiexibility it comes 
as no surprise that our algorithms are also naturally 
robust against various kinds of failures. In particular it 
is easy to give robust deterministic algorithms for the 
fc-local broadcast based on Lemma [6?H For example, a 
simple round robin flooding procedure (with or without 
distance labels to prevent rumors from spreading too 
far) is naturally robust against any random edge failure 
rate 7 with only the necessary 1/(1 — 7)-slowdown. Even 
adversarial permanent failures merely slow down the 
algorithm slightly as they can not cause more harm then 
preventing progress made in the iteration the failed edge 
occurred. We defer the formalization and presentation 
of these robustness results to the journal version of this 
paper and instead point to the very nice work of Keren- 
Censor Hillel and Giakkoupis There, independently, 
a robust randomized alternative to [3] is given including 
proofs for robustness against random temporary and 
random permanent node- and edge-failures. 



6.1 Proofs 



Proof, [of Lemma 6.1 We 



will only 
mod T. 



look at links 
We define Hi 



established at iteration i 
to be subgraph of G consisting of all edges established 
in these iterations until iteration iT. We then prove 
Lemma [4. 2 1 for Algorithm 5 exactly as before. That is: 



Consider the beginning of any iteration iT for 
i < logn in Algorithm 5 and any active node vq. If 
r(?;) \R — {vi , ■ . ■ , Vk} then there are fc-f- 1 many t-trees 
tq, . . . , Tfe as subgraphs in -ff^+i rooted at vq,vi, . . . , 
respectively such that tq is vertex disjoint from for 
any Q < i <k. 

We proof the lemma by induction on i. The base 
case for z = follows directly from the fact that each 
node forms its own 0-tree. For the inductive step we 
assume a vertex vq which at the beginning of iteration 
{i -\- 1)T for I -I- 1 < logn is still active. Let uo be the 
vertex contacted by wo in iteration iT . By induction 
hypotheses in the beginning of iteration iT there was a 
i-tree rooted at vq and a vertex disjoint z-tree rooted at 
uq. These two trees together with the new edge {w, w} 
form the new i -\- 1-tree tq. 

All neighbors Vi of uq that are in T{v) \ i? at 
beginning of iteration (i -f- l)r can not have know about 
Vq at the beginning of iteration iT since otherwise vq 
would know about Vi at iteration iT -\-Tdiff < (i + \)T. 
They were therefore also active at the beginning of 



iteration iT and must have chosen an edge to a node 
Ui. Similarly as done for wq we can find an i + 1-tree 
Ti that consists of the i-trees rooted at Vi and Ui at 
iteration i and the edge {vi,Ui\. It is clear that tq and 
Ti are node disjoint since otherwise there is a path from 
Vi to vq of length at most sum of the depths of tq and 
Ti in Hi^i. This is at most 21ogn which implies that 
in the beginning of iteration iT + T^in < + 1)7" the 
node Vq would know about Vi - a contradiction. 



Proof, [of Corollary 6.1 In this case we get that one 
iteration corresponds to a steps. We have Tdi// = A/3 
since if u is informed about v then there is a path of 
length at most A and it takes at most (5 rounds per step 
until V also knows about u. 



Proof, [of Corollary 6.2 First we note that with high 
probability every node contacts a new neighbor within 
any a — 0(log'^ n) rounds. Furthermore, with high 
probability the number of established neighbors during 
0(log'' n) rounds is at most 0(log^ n) for every node. 
Thus, if a path of established links of lengths at most 
0(logn) occurs between two node u and v then after 
0(log'^ n) steps both nodes will have learned from each 



other with high probability. According to Lemma 6.1 
it takes thus at most max{0(log'^ n), 0(log^ n)} logn — 
0(\og^ n) steps to complete the 1-local broadcast. 

7 Conclusion 

In this paper we presented the first efficient determin- 
istic gossip algorithm for the rumor spreading problem 
and the /c-local broadcast problem. In addition to show- 
ing that all random choices of a certain gossip algorithm 
can be replaced by arbitrary deterministic choices our 
algorithms are also much simpler, more robust, more 
natural and with a running time of 2(fclogn -|- log^ n) 
faster than previous randomized algorithms. 

One interesting question that remains is whether 
the running time of 0(log^ n) for the 1-local broadcast 
problem (or the log n-local broadcast problem) can be 
improved. While we believe that our running time is 
optimal at least for deterministic algorithms we could 
not prove such a lower bound. One direction for proving 
lower bounds even for randomized algorithms would be 
to use the connections to spanners given in [3]. In 
particular any gossip algorithm solving, e.g., the logn- 
local broadcast problem in T = log^^*^ n rounds implies 
the existence of an (a, /3)-spanner with a = Q(\og^ n) 
and /3 — log^^* n with density n log^^* n. No spanner 
of such quality is known to exist for (5 < 1 [17] . 

The connections to spanners from [3] can also be 
used to in the opposite direction to see our gossip 
algorithms for the 1-local broadcast problem as an 
extremely efficient distributed constructions of a sparse 



graph spanner. For example the subgraph of edges used 
by Algorithm 3 or 4 both form a spanner with stretch 
2 logn and n\ogn edges. This is almost optimal as 
8 (logn/ log logn) is the best stretch achievable with 
this density. While constructions of optimal spanners 
are known, the simple distributed construction of a good 
quality spanner in the extremely weak gossip model 
should be of interest. In particular the total number of 
messages used in Algorithm 4 is with n log^ n drastically 
less than any distributed algorithm in the literature 
since these LOCAL-algorithms send a messages over 
each of the potentially 9(n^) many edges in G in 
every round. Similarly, the total amount of information 
exchanged by our algorithm is quite low given that 
there are no distributed algorithms that use less than 
polynomial size messages [TS]. Even the 41og^n round 
construction time is quite fast given the restricted 
model. We remark that our construction can also 
be seen as a deterministic 2 log n round algorithm in 
the LOCAL model since nodes only communicate with 
other nodes in their 2 logn neighborhood. All this, 
together with the simplicity of our algorithm and its 
analysis makes our approach an interesting starting 
point for designing new spanner constructions. One 
question in this direction would for example be: If 
one floods only for k instead of 2 logn hops during 
Algorithm 3 then a fc-stretch spanner is computed. 
What is the density of this spanner when k < log n? 

Another question is whether the message size re- 
quirement can be reduced. While, as remarked in Sec- 
tion [2j this is not reasonable to ask for in the local 
broadcast problem itself the trivial information argu- 
ment does not apply to the bit complexity for discov- 
ering a sparse spanner via a gossip algorithm. Never- 
theless, reducing the message size seems like a hard and 
quite possibly impossible task. As just mentioned even 
for the intensely studied case of constructing spanners 
via randomized algorithms in the less restrictive LO- 
CAL model no algorithms using a subpolynomial mes- 
sage size are known [18) . 
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